EA-ConvNeXt: An Approach to Script Identification in Natural Scenes Based on Edge Flow and Coordinate Attention

نویسندگان

چکیده

In multilingual scene text understanding, script identification is an important prerequisite step for image recognition. Due to the complex background of images in natural scenes, severe noise, and common symbols or similar layouts different language families, problem has not been solved. This paper proposes a new method based on ConvNext improvement, namely EA-ConvNext. Firstly, generating edge flow map from original proposed, which increases number scripts reduces noise. Then, feature information extracted by convolutional neural network ConvNeXt, coordinate attention module proposed enhance description spatial position vertical direction. The public dataset SIW-13 expanded, Uyghur added, named SIW-14. improved achieved rates 97.3%, 93.5%, 92.4% datasets CVSI-2015, MLe2e, SIW-13, respectively, 92.0% expanded SIW-14, verifying superiority this method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the aesthetic dimension of howard barkers art: a frankfurtian approach to scenes from an execution and no end of blame

رابطه ی میانِ هنر و شرایطِ اجتماعیِ زایش آن همواره در طولِ تاریخ دغدغه ی ذهنی و دل مشغولیِ اساسیِ منتقدان و نیز هنرمندان بوده است. از آنجا که هنر در قفس آهنیِ زندگیِ اجتماعی محبوس است، گسترش وابستگیِ آن با نهاد ها و اصولِ اجتماعی پیرامون، صرفِ نظر از هم سو بودن و یا غیرِ هم سو بودنِ آن نهاد ها، امری اجتناب ناپذیر به نظر می رسد. با این وجود پدیدار گشتنِ چنین مباحثِ حائز اهمییتی در میان منتقدین، با ظهورِ مکتب ما...

Chromatic diversity index - an approach based on natural scenes

Common descriptors of light quality fail to predict the chromatic diversity produced by the same illuminant in different contexts such as images of natural scenes. The aim of this paper was to introduce a new index, capable of predicting illuminantinduced variations in the chromatic diversity off natural scenes. The spectral reflectance of each pixel of 50 images of natural scenes obtained usin...

متن کامل

Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

Script identification plays a significant role in analysing documents and videos. In this paper, we focus on the problem of script identification in scene text images and video scripts. Because of low image quality, complex background and similar layout of characters shared by some scripts like Greek, Latin, etc., text recognition in those cases become challenging. Most of the recent approaches...

متن کامل

iranian english learners’ perception and personality: a dual approach to investigating influential factors on willingness to communicate

abstract previous studies on willingness to communicate (wtc) have shown the influence of many individual or situational factors on students’ tendency to engage in classroom communication, in which wtc has been viewed either at the trait-level or situational level. however, due to the complexity of the notion of willingness to communicate, the present study suggests that these two strands are ...

How to develop clinical reasoning in medical students and interns based on illness script theory: An experimental study

Background: Although theory explains the development of illness script, it does not provide answers how medical students develop scripts in their learning. To fill the knowledge gap of developing illness script in medical students and interns, this study aimed to investigate the impact of educational strategies inspired by theory in the development of illness scripts.    Methods: A total of 15...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12132837